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Abstract 

Let S* be a cyclic n-ary sequence. We say that S is a universal cycle 
((n, k)-Ucycle) for fc-subsets of [n] if every such subset appears exactly 
once contiguously in S, and is a U cycle packing if every such subset 
appears at most once. Few examples of Ucycles are known to exist, 
so the relaxation to packings merits investigation. A family {Sn\ of 
(n, A;)-Ucycle packings for fixed is a near-Ucycle if the length of Sn 
is (1 — o(l))(^). In this paper we prove that near-(n, /i;)-Ucycles exist 
for all k. 
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1 Introduction 



A universal cycle (U cycle) for k-suhsets of [n], denoted (n, A';)-UCS, is a cyclic 
sequence of integers from [n] = {1,2, ... ,n} with the property that each k- 
subset of [n] appears exactly once consecutively in the sequence. For example, 
1234524135 is a universal cycle for pairs of [5]. Chung, ct al. [1], defined 
universal cycles for a general class of combinatorial structures, generalizing 
both deBruijn sequences and Gray codes (see [7]). The books [5, 6] contain 
a wealth of information about generating such things efficiently. 

A necessary condition for the existence of an (n, A;)-UCS is that n \ (^) . 
This is because symmetry demands that each symbol appears equally often. 
The authors of [1] conjectured that the necessary condition is sufficient for 
large enough n in terms of k (some evidence [4] suggests n > k + 3) and 
offered $100 for its proof. ^ 

Conjecture 1 [1] For all k > 2 there exists no{k) such that for n > no{k) 
U cycles for k-subsets of [n] exist if and only if k divides (^) . 

Progress on the conjecture has been slow. The k — 2 case is trivial, 
corresponding to the existence of eulerian circuits in Kn if and only if n is 
odd. Jackson [3] proved the conjecture for k = 3, and constructed ucyles for 
k — 4 and odd n, leaving the case n = 2 mod 8 unresolved. In [2] we find 
the following result. 

Result 2 [2] Let no(3) = 8, no(4) = 9, and no(6) = 17. Then {n,k)-UCs 
exist for k = 3, 4, o-nd 6 with n > no{k) and gcd{n, k) = 1. 

Note that no{k) — 3k suffices for k G {3, 4, 6}. It would be nice to lower 
3k as much as possible. To this end, Stevens et al. [8] proved the following. 

Result 3 [8] No {k + 2, k)-UC exists for k>2. 

Combined with particular computer examples found by Jackson [4] (e.g. 
{n, k) — (10, 4)), this suggests that no{k) — k + 3 may suffice. 

^At the 2004 Banff Workshop on GeneraUzations of de Bruijn Cycles and Gray Codes 
it was suggested by the second author that a modest inflationary rate should revalue the 
prize near $250.04. 
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In cases where no Ucycles have been found, or none exist, it is natural to 
look for cycles with as many distinct subsets as possible; that is, a U cycle 
packing, such as the sequence 

S = 1345682 4678135 7812468 2345713 5678246 8123571 3456824 6781357 

for (n, k) = (8, 4). Note that no (8, 4)-UCS exists, and that S accounts for 56 
of the possible 70 subsets. One might notice that these blocks shift upward 
by 3 mod 8 from one to the next. This is important in the techniques that 
follow. 

In their paper, Stevens, et al., show that the longest possible packing 
of a (A; + 2, A;)-UCS has length k + 2 and a packing achieving this bound 
always exists. Compared to the potential {^^^^ length, this shows that for 
n = A; + 2, we cannot get close to to a full universal cycle. To estabhsh 
this notion formally, we define a near-Ucycle packing as a sequence of Ucycle 
packings, one for each n, such that as n — oo, asymptotically few /c-subsets 
are omitted from the {n, A;)-UCS packing. That is, the length of cycle Sn is 
(1 — o(l))(^). For example, if n is even and M is any perfect matching in 
Kn then — M is eulerian. In particular, any eulerian circuit is a near- 
(n,2)-UCS of length (1 — (2). The purpose of this paper is to prove that 
near-Ucycles exist for all k. 

Theorem 4 For all k, near-{n, k)-Ucycles exist. 

We proceed in the proof of this theorem by analyzing the construction 
of universal cycles. As we will show, we can create a Ucycle packing by 
selecting only those subsets that avoid certain structure. We prove that 
there are asymptotically few such structured subsets. 

2 General Technique 

The general technique used to construct Ucycles originated from [3]. It con- 
sists of classifying the component subsets by their structure and ordering 
them accordingly. We write the /c-subset S of [n] as S = {si, . . . , Sk}, with 
Si < Si+i, and define the form of S as F = (/i, . . . , fk) by fi = Si+i - s,, 
where indices are modulo k and arithmetic is modulo n. That is, the form of 
a set is the ordered collection of distances between set elements, (see Figure 

1) 
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Figure 1: Form visualization of {1, 3, 5} 



Consider the following example. For {n,k) = (8,3), the set {1,3,5} 
has form is (2,2,4). We consider the cyclic permutations of a form to be 
equivalent, so (2,4,2) and (4,2,2) can both serve as the form of {1,3,5}. 
The choice of form has to do with how the form appears in the cycle. For 
example, the form F = (4,2,2) makes the sets {1, 3, 5}, {2, 4, 6}, . . . , and 
{8, 4, 2} appear as 513, 624, . . . , and 482, respectively. Note that the last 2 
in F is not used for these sets, and so we may represent F as (4, 2; 2), or 
more simply (4,2). In our techniques below, we will restrict our attention 
to forms (/i, . . . , /fc_i; fk) having unique fk- In fact, we choose fk to be the 
largest unique entry. 

It is important to note that the method used here to construct the forms 
is somewhat limited. As part of our definition, every form has entries whose 
sum is n. We call such forms simple. However, it is possible to relax this 
condition. For example, the set {5,3,1} would have the form (6,6,4). If 
we allow the sum of form entries to be a multiple of n, we would have more 
freedom in representing the subsets, and hence a better method of construct- 
ing Ucycles could be developed. These forms we define as crossing. In this 
paper, we will only use simple forms. 

The purpose of the forms is to model what occurs in a Ucycle, namely, 
that if sqSi . . . Sk appears in a Ucycle then the forms (/i, . . . , fk-i) and 
(/2, . . . , /fe) of the sets {sq, • • • , Sk-i} and {si, ...,Sk} overlap on (/2, . . . , fk-i). 
This motivates the following definitions. 

For a given form rep. (/i, . . . , fk-i), we define (/i, . . . , fk-2) to be its 
prefix, and (/2, . . . , fk-i) to be its suffix. 

The transition graph, denoted %i,k, is a graph whose vertices are the 
prefixes (and suffixes) of the form representations of (n, A;)-UCS. The directed 
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Figure 2: Transition Graphs 



edges are the form representations, drawn from prefix to suffix. In order for 
this construction to produce a Ucycle, it is necessary that this transition 
graph be Eulerian. 

Consider the cases of {n,k) = (8,3) and {n,k) = (10,4), and the forms 
of each: 

We use these forms to construct Tg^s and '7io,4, as shown in Figure 2. 

The condition of evenness is highhghted by these two examples. We know 
that a Ucycle is possible for {n,k) = (8,3) but not for {n,k) = (10,4). As 
shown in the following result, this is directly connected to the fact that % 2 
is Eulerian and 7io,4 is not. 

Result 5 [2] If Tn,k is Eulerian for some choice of form representations, 
then an {n, k)-UCS exists. 

As we can see in Tlo,4, the vertex 44 has no out degree. This is not 
necessarily the case. If we write the form (1,1, 4; 4) as (4, 4, 1;1), then we 
would instead connect 44 — 41. Since this form has no unique entry, this is 
left ambiguous. We call such forms bad. We define all forms with at least 
one unique element, a clear representative, as good. 

Consider the previous example. We can ignore all bad forms of (n, k) = 
(10,4), and construct the transition graph, as shown in Figure 2. 

In this case, ignoring the bad forms yields an even transition graph. As 
stated in the following result, this is true for any connected transition graph. 



5 




Figure 3: Good Tlo,4 



Note, however, that it is not Eulerian since ((33)) is not connected. This 
problem disappears if we also ignore all such isolated cycles, as stated in 
the following result. As we will show, the number of such isolated cycles is 
negligible as n — > oo. 

Result 6 [2] If 7in,k is connected and there are no bad forms for k-subsets 
of [n] , then T^^k is Eulerian. 

We can construct a Ucycle packing by ignoring such bad forms for any 
(n, k) pair and restricting our attention to the largest component of 7^,^, the 
component containing ((1 ... 1)). 

3 Main Result 

In order to prove that a near Ucycle packing is possible for any /c, we must 
show that an asymptotically large proportion of sets can be included in a 
Ucycle. As we will show, the good sets, those belonging to good forms, can 
easily be included in a Ucycle packing, while the remaining bad sets cannot. 
As we have indicated, we will only include good sets. Of course, not all good 
sets can be included. It therefore remains to show that the good sets which 
can be included asymptotically outnumber all other sets, and that a Ucycle 
packing that includes each of these sets can be created. 
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3.1 Counting Good Sets 

Consider the example of the (8, 4)-UCS. We know that each form must have 
four entries, and, since each form is simple, the sum of these entries must be 
eight. The number of times each entry appears is also important, since the 
bad forms will have no entry appearing only once. Therefore, wc do not need 
all of its entries in order to determine whether or not a form is bad. We need 
only to know how many times each entry appears. For example, the form 
(1, 2, 2; 3) has the unordered pattern (1, 2, 1) = (2, 1, 1), essentially the hst of 
multiplicities. ^ Since this pattern contains a 1 as an entry, it has a unique 
element and therefore all forms with this good pattern are good. In this case, 
the pattern entries define a partition of 4, since each form has 4 entries. In 
general, every pattern of an (n, /c)-UCS is a partition of k. 

Consider a bad pattern P — (pi, . . . ,pt). Clearly, Pi >2 for all 1 < i < t, 
so there exists a corresponding good pattern P' = {pi, ■ ■ ■ ,pt — 1, !)• We 
define the function : F — > from the set of of good patterns to the set of 
bad patterns as 7i{P) = P'. Since this map applies to any bad pattern, we 
can see that < |F|. ^ 

As we can see, many forms may belong to the same pattern. In fact, the 
forms belonging to a particular pattern satisfy the equation X^Li Pi^-i ~ 
for some positive integers Xi,...,Xt, where {pi,P2, ■ ■ ■ ,Pt) ai'e the pattern 
entries. However, this equation imposes no order on the solution, and thus 
does not distinguish between two different forms that share the same entries. 
We say that two such forms belong to the same class. 

For example, the forms (1, 2, 2; 3), (2, 1, 2; 3), and (2, 2, 1; 3) both share the 
class [1, 2, 2; 3]. By convention, we order the entries of a class from smallest 
to largest entry. 

As we know, the classes belonging to a pattern P are all representations 
of n as a positive distinct integer linear combination oi pi, . . . ,pt. We count 
the classes with the aid of Shur's theorem. For P — (pi, . . . define the 
function 



^We can also denote the form (1, 2, 2, 3) as (3^2^!^), a notation we will find nseful later 
■^Note that the number of good partitions of k is equal to the number of partitions of 

k — 1, denoted p{k — 1). Since p{k) ~ e'^V^^^/4-\/3A; ~ p{k — 1), almost all patterns are 




good. 
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Theorem 7 [9J Suppose that gcd(pi, ■ ■ ■ ,Pt) — 1; cind define k — Yl*j=iPj 
and Q — Yl^^^Pj. Then 



HP) 



{t-l)\Q 



Counting the classes requires a slightly more general result. Namely, we 
want to extend Schur's theorem as follows. We define 



[xi, ...,xt) I ^PjXj = n, Xj > 1, Xj XiWiy^ j 



Lemma 8 Define g — gcd{pi, . . . ,pt), k — Yl]=iPj> '^'^^ Q — 115=1 Pj- Then 

ni,'(p\ - ^)*"' _ {n-ky-^ ^ PiPj 

^ ^ (i-l)W-^ (t-2)!Q5*-2^p,+p,. ■ 

Proof. 

Since all form entries are positive integers, we count only integer solutions 
to '^^j^iPjXj = n such that Xj > 1. This is equivalent to finding all integer 

solutions to the equation Yl]=iPji^j + 1) = Yl]=iPj^j + k = n, without the 
positivity constraint. 

We further modify the system to act for the condition gcd{pi, ■ ■ ■ ,Pt) — g- 
The equation ^ Sj=i Qj^^i = is the same as our original 

equation, and clearly gcd{qi, . . . ,qt) = 1. 

Finally, we need to account for the fact that for a pattern of size t, 
each corresponding class has exactly t distinct entries. If, for example, 
P — (2, 1, 1), then C = [1, 1, 3, 5] is one possible class. However, [3, 3, 3, 1], 
although it is a valid solution to Yl^=iPi^i = does not fulfill the re- 
quirement that Xi 7^ Xj for all i ^ j. This is easily seen if we adopt the 
notation C = [xf , . . . , ]. Written in this way, [1,1,3,5] = [123^5^] and 
[3,3,3, 1] = [3^3^1^] In the latter case, Xi — xi-, so the class [3^3-'^l-'^] is not 
valid. Therefore, we must account the number of such non-distinct solutions. 
If Xi = Xk, then the equation reduces to {pi + pk)xi + J2]=ijj^i,jj^kPj^j = ^• 
Wc can then apply Schur's theorem to this equation as we did before. For 
the new system, Q' = (pi + Pk) H^^^j^ij^^Pj = so the total number 

of non-distinct solutions to this equation is ~ (/— 2)ugg*-2 Sj<j p +p ■ ' 
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In order to count the classes, we now apply Lemma 8 to each pattern. 
We want to show that as n gets large, a given bad pattern P has far fewer 
classes than its good component 4>{P)- First, we must determine the size of 
Ei<i ^"^y pattern P. 

For any Pi, we know that ^ = for some c < k, which is max- 

Pi ~rPj C 

imized when pi = c/2. If pi = c/2, then X^j=iPi= ^= k. Thus c = ^. 
Since each pi = c/2, we maximize the value of this expression when pi = k/t. 

Thus, E.<,^< Qlf-4l = f <^V4. 

Using this upper bound, we calculate that the number of classes belonging 
to a pattern P is 

{n-ky-^ {n-kf-'^ {n-kf-^ 



(t - ly.Qg^-' (t - 2)\Qg^-' 4 (t - l)\Qg^-' ' 

With this application of Schur's theorem, we can count the number of 
classes belonging to good patterns compared to the number of classes be- 
longing to bad patterns as follows. 

c(P) _ (n - ky-' t\ UT=\ pM - 1) . . t{pt - 1) ^ Q 



c(0(P)) {t-iy.g'-^UUP^ i^-ky Ptn 

That is, the good classes asymptotically outnumber the bad classes. 

It still remains to show that the number of forms of a bad class C be- 
longing to a bad pattern P is no greater than the number of forms of a good 
class C of (f){P). As we know, the forms of a class are all permutations of 
the class entries modulo cyclic rotation. Therefore, each class C G P has 
Jri'^^' , forms, while classes C e MP) has ''[, — -r forms. That is, the 

good forms outnumber the bad forms by a factor of kpt. 

Finally, it remains to count the sets. By definition, all good forms have 
at least one unique element. Thus the good sets have at least one unique 
difference between two elements. This implies that every good form contains 
exactly n sets. ^ Therefore, the number of sets per good form is at least the 
number of sets per bad form. 



^The bad forms do not have a unique element, so this is not always the case for bad 
sets. Instead, we only know that the numbca- of sets contained in a bad form is a factor 
of n. (Symmetry can reduce the number of sets; e.g. (3) has forms (1, 4, 7), (2, 5, 8), and 
(3,6,9) when n = 9.) 
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By counting the patterns, classes, forms and sets, one can see that almost 
every set is good. However, it remains to be shown that asymptotically many 
of these good sets can be arranged into a Ucycle packing. We will use the 
following lemmas to prove that this is true, and therefore a near Ucycle 
packing is always possible. 

Lemma 9 IfTn^k is restricted to the good classes then it is a union of cycles, 
and hence even. 

Proof. For a given class C and transition graph 7^ j.) we define the graph 
%i,k{C) to be the restriction of Tn^k to the edges belonging to the forms of C. 

If the class C is good, then Tn^k{C) is a cycle or union of cycles. Each 
form of C has a unique representative c/j, thus for any permutation F of 
{ci, . . . , Cfc_i}, all cyclic permutations of F are also forms of C. Since Tn^k{C) 
is a union of cycles for each good C, it is clear that 7^,^ restricted to the 
good classes will be a union of Eulerian subgraphs, and is therefore even. □ 

Corollary 10 // the restriction of Tn^k to good classes is connected, then it 
is Eulerian. 

Proof. Since the restriction of T^ k to good classes is a union of cycles, it is 
easily seen that if T^^k is connected, it is surely Eulerian. □ 
By the result in [2], if 7^,^ is Eulerian, then an (n, /c)-UCS exists. There- 
fore, if we can show that the restriction of 7^,^ to good classes is connected, 
then it follows that a Ucycle packing exists that includes all good sets. How- 
ever, Fig. 2 demonstrates that this is not always the case. Instead, we prove 
that an asymptotically large component is connected. Since we have proven 
that (1 — o(l))(^) sets are good, a Ucycle packing that includes all good sets 
is a near-packing. 

3.2 Finding a Large Component 

In order to study the components of the restriction of %i^k to good classes, we 
define the class graph, denoted Hn,k, as the undirected graph whose vertices 
arc all classes of (n, /c)-UCS . An edge is drawn between the class represen- 
tations that differ by only one entry. For example, [1,2, 2; 5] and [1,1, 2; 6] 
are connected in 7iio,4. 
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If Ci and C2 are connected in T-Cn,k, then this means that Tn^k{Ci) and 
Tn.kiC2) will also share vertices. For example, [1,2, 2; 5] and [1,1, 2; 6] are 
connected in Hio,4, and correspond to the cycles 

((12)) ((22)) ((21)) ((12)) 

and 

((11)) - ((12)) ^ ((21)) - ((11)) . 

Just because Ci and C2 are connected in Tn, A;, this does not guarantee 
that the union of ^,^((^1) and 7^,fc(C2) will be connected. It could happen, 
for example, that each Ci has two components that connect, resulting in two 
components for Ci U €2- However, as proven in [2], if 'Hn,k is connected, then 
the union over all classes produces a connected 7^,^- We will clarify this with 
the map defined as follows. 

Let C — [d^ , ■ ■ ■ , c^ll^ ; Ct] be a class, where the entry q appears pi times, 
and Ct is the largest singleton. To connect the class good classes, we define the 
map K : c(r) ^ c(r) by [c?\ . . . , c^r/; q] [1, . . . , c^l^"'; Ct + c^^i - 1] 
We then apply k again, each time adding another 1. 

For example, k connects the class [2, 2, 2; 4] to the class [1, 1, 1; 7] of 7^10,4 
by the path 

[2, 2, 2; 4] ^ [1,2, 2; 5] ^ [1,1, 2; 6] ^ [1,1,1; 7]. 

In this way, we are able to connect the majority of the classes to the class 
[1, . . . , 1; A; — t + 1]. However, using k, docs not work for every good class. For 
example, the class [3, 3, 3; 1] is surely good. However, k. maps [3, 3, 3; 1] to 
[1, 3, 3; 3]; that is, to itself. In general, if Ct is the largest singleton of a class 
C, then Ct + Ct_i — 1 will be the largest singleton of k(C) only if ct > 1. 

3.3 Counting Awesome Sets 

In order to circumvent the problem introduced above, we restrict our at- 
tention to the awesome classes, the good classes whose largest singleton is 
greater than 1. Using Schur's theorem, we can show that the number of 
classes that are not awesome is negligible, as shown below. 

t 

{xi,...,xt) I ^PjXj = n, 1 = xi = pi < p2 < • • • < Pt, Xj>l 
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I j=2 
I j=2 

{n-2k- 
(t-2)!^*-2g 

« n'-' 
for n > 2k. 

Since the number of non-awesome classes is negligible compared to the 
number of total classes, we can disregard them and restrict our attention to 
the awesome classes, which still comprise an asymptotically large proportion 
of all subsets. 

3.4 Proof of Theorem 4 

Since almost all classes are awesome, we know that each awesome class is 
connected to [1, 1, . . . , 1; n — A; + 1] in T-Cn,k- Since the awesome classes of Tin.k 
are connected, the restriction of 7^ ^ to the awesome classes is also connected. 
If the restriction of T^^k to awesome classes is connected, then by lemma 9, 
we know that all awesome sets, the sets belonging to awesome classes, can 
be connected to form a Ucycle packing. Since, as shown above, the awesome 
sets represent an asymptotically large proportion of the total sets, it follows 
that this Ucycle packing is a near Ucycle packing. □ 

4 Remarks 

Many of the techniques presented here can be extended to other forms of 
Ucycle approximations. For example, it is possible to include any set in 
a Ucycle packing by simply inserting the set elements anywhere in the cy- 
cle. We could therefore construct a Ucycle coveringhj simply appending all 
non-awesome sets onto a near Ucycle created using the techniques we have 
described. However, this is very inefficient because it increases the length of 
the cycle by k for each added set instead of the desired 1. In order to find 
a more elegant method of constructing Ucycle coverings, more complicated 
analysis is required. As stated earlier, each bad form may only produce a 



n - 1, 1 < p2 < ■ ■ ■ < Pt, Xj > 1 



n-2k-l, 1< p2 < • • • < Pt, > 
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factor of n sets. Therefore, the method used to connect awesome sets into the 
Ucycle will not work, since traversing an Eulerian transition graph n times 
is impossible for many bad forms. 

One possible method of connecting bad sets in a Ucycle is to consider 
multiple form classes. Currently, for a form F — {fi, . . . , fk), we require 
Yli=i fi = ^- However, it could be useful to consider Yli=i fi ~ for a > 1. 
This would allow much more freedom in representing sets, and therefore more 
ways of connecting sets. 

Finally, due to our proof that near Ucycles exist, we believe that we 
deserve asymptotically much of the prize money, or $[1 — o(l)](250.04). Since 
we do not know the speed of the o(l) term, we have made a conservative 
estimate of $249.99. 
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